Search CORE

8 research outputs found

Sentimenduen analisia euskaraz: lexiko-mailatik erlaziozko diskurtso-egiturarako proposamena

Author: Alkorta Jon
Gojenola Galletebeitia Koldobika
Iruskieta Quintian Mikel
Publication venue: 'UPV/EHU Press'
Publication date: 01/01/2016
Field of study

Nowadays, opinion texts play an important role, in fact, people read opinions before they do an activity, buy a product or take a decision. However, the amount of opinion text is increasing rapidly and reading all opinions about a subject is unfeasible. ‘Sentiment analysis’ is a part of Natural Language Processing whose aim is to process opinion texts. This work is a part of ‘sentiment analysis’ and presents a first approximation to the assignment of polarity to texts written in Basque. Rhetorical Structure Theory has been used to assign different weights to text spans and an analysis has been performed at the lexical level. This method has been compared with other approaches and the results are promising

Archivo Digital para la Docencia y la Investigación

Advances in monolingual and crosslingual automatic disability annotation in Spanish

Author: Andrés Santamaría Edgar
Atutxa Salazar Aitziber
Goenaga Azcarate Iakes
Gojenola Galletebeitia Koldobika
Publication venue: BMC
Publication date: 01/06/2023
Field of study

Background Unlike diseases, automatic recognition of disabilities has not received the same attention in the area of medical NLP. Progress in this direction is hampered by obstacles like the lack of annotated corpus. Neural architectures learn to translate sequences from spontaneous representations into their corresponding standard representations given a set of samples. The aim of this paper is to present the last advances in monolingual (Spanish) and crosslingual (from English to Spanish and vice versa) automatic disability annotation. The task consists of identifying disability mentions in medical texts written in Spanish within a collection of abstracts from journal papers related to the biomedical domain. Results In order to carry out the task, we have combined deep learning models that use different embedding granularities for sequence to sequence tagging with a simple acronym and abbreviation detection module to boost the coverage. Conclusions Our monolingual experiments demonstrate that a good combination of different word embedding representations provide better results than single representations, significantly outperforming the state of the art in disability annotation in Spanish. Additionally, we have experimented crosslingual transfer (zero-shot) for disability annotation between English and Spanish with interesting results that might help overcoming the data scarcity bottleneck, specially significant for the disabilities.This work was partially funded by the Spanish Ministry of Science and Innovation (MCI/AEI/FEDER, UE, DOTT-HEALTH/PAT-MED PID2019-106942RB-C31), the Basque Government (IXA IT1570-22), MCIN/AEI/ 10.13039/501100011033 and European Union NextGeneration EU/PRTR (DeepR3, TED2021-130295B-C31) and the EU ERA-Net CHIST-ERA and the Spanish Research Agency (ANTIDOTE PCI2020-120717-2)

Archivo Digital para la Docencia y la Investigación

Erroreak automatikoki detektatzeko tekniken azterlana eta euskararentzako aplikazioak

Author: Díaz de Ilarraza Sánchez Arantza
Gojenola Galletebeitia Koldobika
Oronoz Anchordoqui Maite
Publication venue: Servicio Editorial de la Universidad del País Vasco/Euskal Herriko Unibertsitatearen Argitalpen Zerbitzua
Publication date: 01/01/2009
Field of study

In this article, we study the techniques used for detecting errors in Natural Language Processing (NLP). We classify the techniques according to their approach (symbolic or empirical), and then we describe them in depth. Following that, we describe the systems we have developed for detecting syntactic errors in Basque, by using that technique as a criterion for the classification of those systems, and enhancing it with examples

Archivo Digital para la Docencia y la Investigación

Universidad del País Vasco / Euskal Herriko Unibertsitatea: Ciencia - Portal de revistas digitales de la UPV/EHU

Oinarrizko baliabideak eta beren aplikazioa aditzen azpikategorizazio-informazioaren erauzketan eta erroreen tratamenduan

Author: GOJENOLA GALLETEBEITIA Koldobika
Publication venue
Publication date: 01/01/2000
Field of study

Euskal Doktorego Tesien Bilduma - Repositorio de Tesis Doctorales

Euskararako analizatzaile sintaktiko estatistikoa hobetzeko teknikak

Author: Bengoetxea Kortazar Kepa Xabier
Gojenola Galletebeitia Koldobika
Publication venue: 'UPV/EHU Press'
Publication date: 01/01/2016
Field of study

This paper presents a set of experiments to improve the results of the statistical syntactic analyzers for Basque. The present work has examined different techniques: i) tree transformations, ii) stacking, and iii) combinations of the output of several parsers. All the results have been obtained using gold morphosyntactic tags coming directly from the treebank and using automatic mophosyntactic tags coming from morphological analysis and disambiguation module.; Artikulu honetan euskararako analizatzaile sintaktiko-estatistikoen emaitzak hobetzeko helburuarekin egindako esperimentu-multzoa aurkezten da. Lan honetan teknika ez-berdinak aztertzen dira: i) zuhaitz-transformazioak, ii) analizatzaileen pilaketa, eta iii) analizatzaile-modelo desberdinen irteeren konbinazioa. Emaitza guztiak zuhaitz-bankutik zuzenean hartutako urre-patroiko ezaugarri morfosintaktikoak erabiliz eta analisi morfologiko eta desanbiguatze-moduluetatik hartutako ezaugarri morfosintaktiko automatikoak erabiliz egin dira

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital para la Docencia y la Investigación

EUSLEM: Un lematizador/etiquetador de textos en euskara

Author: Aldezabal Roteta Izaskun
Alegría Loinaz Iñaki
Artola Zubillaga Xabier
Díaz A.
Ezeiza Ramos Nerea
Gojenola Galletebeitia Koldobika
Publication venue: Sociedad Española para el Procesamiento del Lenguaje Natural
Publication date: 01/01/1994
Field of study

Secretaría de Estado de Cultura

Dependentzia Unibertsalen eredura egokitutako euskarazko zuhaitz-bankua

Author: Aranzabe Urruzola María Jesús
Atutxa Salazar Aitziber
Bengoetxea Kortazar Kepa Xabier
Díaz de Ilarraza Sánchez Arantza
Goenaga Azcarate Iakes
Gojenola Galletebeitia Koldobika
Uria Garin Larraitz
Publication venue: 'UPV/EHU Press'
Publication date: 01/01/2019
Field of study

Hizkuntzaren Prozesamenduan kokatzen den Dependentzia Uniber-tsalen proiektuaren helburua da hainbat hizkuntzatan sortu diren dependentzia-ereduan oinarritutako zuhaitz-bankuak etiketatze-eskema estandar berera egokitzea. Artikulu honetan, eredu horretara automatikoki egokitu den euskarazko zuhaitz-bankua aurkez-ten da; halaber, egokitzapen-lan hori nola gauzatu den deskribatzen da eta, azkenik, ho-rretan oinarrituta, azaltzen da zer antzekotasun eta zer desberdintasun diren jatorrizko zuhaitza-bankuaren eta Dependentzia Unibertsalen eredura egokitutako zuhaitz-ban-kuaren artean.; In the Natural Language Processing research area, the aim of the Uni-versal Dependencies project is to convert dependency based treebanks developed in different languages into the same standard tagging scheme. This article presents the automatic conversion of the previously existing Basque treebank into this universal tagging scheme. This work describes how the conversion process has been carried out and highlights the similarities and differences between the original Basque treebank and the Universal Dependency based version of it

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital para la Docencia y la Investigación

Universidad del País Vasco / Euskal Herriko Unibertsitatea: Ciencia - Portal de revistas digitales de la UPV/EHU

Exaeskalarako sare-interkonexioen diseinurako helburu-aniztasuneko optimizazioa

Author: Aranzabe Urruzola María Jesús
Atutxa Salazar Aitziber
Bengoetxea Kortazar Kepa Xabier
Díaz de Ilarraza Sánchez Arantza
Goenaga Azcarate Iakes
Gojenola Galletebeitia Koldobika
Uria Garin Larraitz
Publication venue: 'UPV/EHU Press'
Publication date: 01/01/2019
Field of study

Exascale performance will be delivered by systems composed of millions of inter-connected computing cores. The way these computing elements are connected with each other (network topology) has a strong impact on many performance characteristics. In this work we propose a multi-objective optimization- based framework to explore possible network topologies to be implemented in the EU-funded ExaNeSt project. The modular design of this system’s inter-connect provides great flexibility to design topologies optimized for specific performance targets such as communications locality, fault tolerance or energy-consumption. The generation proce-dure of the topologies is formulated as a three-objective optimization problem (minimizing some topological characteristics) where solutions are searched using evolutionary techniques. The analysis of the results, carried out using simulation, shows that the topologies meet the required performance objectives. In addition, a comparison with a well-known topology reveals that the generated solutions can provide better topological characteristics and also higher throughput in almost all evaluated scenarios.; Exaeskala errendimendua milioika kalkulu-nukleoz osaturiko sistemak erabi-liz lortuko da. Elementu horiek konektatzeko moduak (sare-topologia) izugarrizko eragina du hainbat errendimendu ezaugarritan. Lan honetan, sare-topologiak diseinatzeko helburu-anizta-suneko optimizazioan oinarritutako ingurune bat proposatzen dugu, EBk finantzatuta ExaNeSt proiektuan garatzen ari garena. ExaNeSt sistemaren sarearen modulartasunari esker sare-topolo-gia ezberdinak diseinatu ditzakegu hainbat errendimendu-helburu optimizatzeko; esaterako, in-guruko komunikazioak, hutsegite-tolerantzia eta energia-kontsumoa. Topologiak sortzeko pro-zesua optimizazio bidez gauzatzen da (sare-topologiaren hainbat ezaugarri minimizatuz) teknika ebolutiboak erabilita. Simulazio bidezko emaitzen analisiak sortutako topologiek errendimen-du-helburuak betetzen dituztela erakusten du. Gainera, sare-topologia ezagun batekin egindako konparazioan ikus daiteke gure proposamenak sortzen dituen sareek propietate topologiko ho-beak dauzkatela eta, aldi berean, errendimendu handiagoa lortzen dutela

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital para la Docencia y la Investigación